Skip to content

Hypvector 0.1.1#66

Merged
platypii merged 3 commits into
masterfrom
hypvector011
May 26, 2026
Merged

Hypvector 0.1.1#66
platypii merged 3 commits into
masterfrom
hypvector011

Conversation

@platypii

Copy link
Copy Markdown
Contributor
  • Use prefetchBinary to pull the ~7.5 MB binary column into RAM at load, so every query skips phase-1 fetches
  • Move the instrumentation wrapper below cachedAsyncBuffer so counters reflect real network reads, not cache hits
  • Track peak in-flight concurrency and surface it in the stats bar
  • Bump dataset from 50k to 156k English Wikipedia titles (docs + UI label)

platypii added 3 commits May 25, 2026 15:19
Wrap the raw HTTP-backed AsyncBuffer rather than the cached layer so the
stats bar reports actual network fetches (not cache hits). Adds two new
metrics that surface the per-query bottleneck:

  net wait:       summed time across all in-flight slice() calls
  max concurrent: peak in-flight requests (confirms h3 parallelism)

The previous counter wrapped the cached buffer, so cache hits were
counted as fetches and the numbers were inflated 2x.
Bump hypvector dep to 0.1.1 and call prefetchBinary at page load so
phase-1 Hamming scans run from memory. Cuts ~30 fetches per query and
significantly tightens p90 over the CloudFront/WAN cold path.

The 7.5 MB binary column download happens once during the 'loading
vector index...' status, then every subsequent query skips its
binary parquet fetches entirely.
@platypii platypii merged commit a460d5c into master May 26, 2026
4 checks passed
@platypii platypii deleted the hypvector011 branch May 26, 2026 00:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant